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Abstract. When data do not conform to the hypothesis of a known sampling- 
variance, the fitting of a constant to the set of measured values is a long debated 
problem. Given the data, the fitting would require to find which measurand 
value is most probable. A fitting procedure is here reviewed which assigns 
probabilities to the possible measurand values, on the assumption that the 
uncertainty associated with each datum is the lower bound to the standard 
deviation. This procedure is applied to derive an estimate of the Planck constant. 
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1. Introduction 

Given a set of measured values of a constant and the associated uncertainties, the 
Gauss-Markov theorem states that the unbiased minimum- variance estimator of the 
measurand is the weighted mean [1] . The uncertainty of the mean, which is smaller 
than the smallest uncertainty, does not depend on data spread. This is a consequence 
of the assumption that the variance of the sampling distribution of each datum is 
exactly know. In practice, this hypothesis is often false and the inconsistency of the 
data - quantified, for example, by the \ 2 01 the Birge-ratio values - suggests that 
the uncertainties associated with the data are merely lower bounds to the standard 
deviations of their sampling distributions. In this case, the Gauss-Markov theorem is 
of no help and the choice of an optimal measurand value is long debated issue. The 
consequences of the assumption that the associated uncertainties are point estimates 
of the standard deviations are illustrated by Dose [2], who considers the estimate of 
the Newtonian constant of gravitation. 

Decision theory and probability calculus help to deal with the discrepancy 
between the quoted uncertainties and the data scatter. Given the measurement 
results and the lower bounds of the standard deviations, the first step is to find the 
probabilities of the possible measurand values. As described by Sivia [3J, who solved 
the related problem of dealing with outliers, probabilities are assigned by application of 
the product rule and marginalization. Next, given the loss due to a wrong decision, the 
optimal choice of the measurand value minimizes the expected loss over the assigned 
probabilities. Since foundations arc in the probability theory, this method makes it no 
longer necessary to exclude the data disagreeing with the majority, as well as to scale 
the uncertainties to make the data consistent. After reviewing the Sivia's analysis, 
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the procedure is here applied to estimate the value of the Planck constant from a set 
of inconsistent measurement results. 

2. Problem statement 

Let us consider a set of N measured values x% of a mcasurand h, which have 
been independently sampled from Gaussian distributions having unknown variance 
cf _t wf, where Ui is the uncertainty associated with the Xi datum. The problem 
is to hnd optimal estimates of the measurand value and of the confidence intervals. 
Clearly, we cannot rely on the weighted mean, because the actual variances of the 
sampling distributions are not known. On the other hand, we cannot go back to 
the arithmetic mean, because it leaves out significant information delivered by the 
associated uncertainties. 

3. Problem solution 

Given the data and their associated uncertainty, the solution is to assign probabilities 
to the h values and to use probabilities to minimize any given loss function [H [5J [5] . 

Firstly, we make probability assignments to the possible Ui values, for which only 
the Ui lower bounds are known. In the absence of any additional information, we 
assume that the probability distribution of Cj is independent of the measurement 
unit, that is, aP{a<r) = P(cr), where P(a) is the sought distribution of the a values 
[3]. This implies 



where the Heaviside function 9 (a — u) is 1 if a > u and zero if a < u and the i 
subscript has been dropped. The distribution (TTJ) is improper, but, provided the final 
measurand distribution is integrable, this is not a serious problem. Anyhow, we can 
always set a finite upper bound and delay the calculation of the upper bound limit to 
the infinity until last. 

Next, having stated P(cr;w), the sampling distribution of each x datum can be 
marginalized to eliminate the unknown variance. Hence, 



where N(x;h,a 2 ) is a normal distribution of the x values having h mean and a 
variance and erf is the error function. 

Eventually, we make pre-data probability assignments to the h values. In the 
absence of any additional information, we assume that they are independent of the 
unit-scale origin, which implies a uniform distribution. 

By application of the product rule of probability, the only post-data probability 
distribution of the measurand values, logically consistent with the pre-data 
assignments and sampling distributions, is 



where the Q function is given in ([2"]l. 1/Z is a normalization factor, Xi and Ui are the 
i-th datum and its uncertainty. 



P(cr;u) = 6{<j-u)/(t, 



(1) 




Q(h) = -IjQ(a*;fc,tii) 



(3) 
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The post-data distribution ([3]) is the central to our analysis. Given any loss 
L(h — h) due to a wrong estimate h , the optimal choice of the h value minimizes 

/+oo 
L(h -h)Q{h)dh. (4) 
-oo 

With a quadratic loss, the optimal estimate is the mean; with a linear loss, it is the 
median. With a loss independent of the error, it is the mode. Confidence intervals are 
easily calculable from ([3]). 

4. Planck constant estimate 

As an application example, let us consider the choice of the Planck constant value on 
the basis of the measurment results listed in table [TJ We neither presume to account 
for the interlinks between the data nor we presume to compete with the adjustments 
carried out by the committee on data for science and technology [?]■ 

The Planck constant links energy and momentum to the frequency and wavelength 
of the, classical or relativistic, wave-function. Therefore, h determinations match 
energy (momentum) and frequency (wavelength) measurements. According to whether 
an electrical or mechanical system is considered, the measurement result is a value of 
the h/e or h/m ratio, where e and m are the electron charge and mass. 



Table 1. Values of the Planck constant. The input data are from [7] and the 
references therein, with the exception of the last two, that are from [8]. The values 
calculated from the quotient of h and the neutron mass, the mass of a particle or 
of an atom, and the neutron binding energy depend on the same measured value 
of N A [5]. 



method 


h 1 10~ ;54 Js 


quotient of h and the electron charge 




h/(2e) (NMI 89) 


6.6260684(36) 


h/(2e) (PTB 91) 


6.6260670(42) 


watt-balance experiments 




h/rriR (NPL 1990) 


6.6260682(13) 


h/m R (NIST 1998 ) 


6.62606891(58) 


h/m R (NIST 2007) 


6.62606901(34) 


h/m R (METAS 2011) 


6.6260691(20) 


h/rriM (NPL 2010) 


6.6260712(13) 


quotient of h and the neutron mass 




N A h/M(n) 


6.62606887(52) 


quotient of h and the mass of a particle or of an atom 




N A h/M(e) 


6.62607003(20) 


N A h/M(Cs) 


6.62607000(22) 


N A h/M(Rb) 


6.62607011(22) 


weighted mean 


6.62607005(20) 


quotient of h and the neutron binding energy 




h/Am{ 29 Si) 


6.6260764(53) 


h/Am{ 33 S) 


6.6260686(34) 


weighted mean 


6.6260709(29) 
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Figure 1. Values of the Planck constant given in table [T] The reference is the 
CODATA 2006 value, h = 6.62606896 x 10" 34 Js [FJ. 



The electric determinations are based on the measurement of h/(2e) by the 
tunneling of Cooper's pairs in a Josephson junction. The watt-balance experiments 
accede directly to the h/rrifi ratio, where m& is the mass of the international kilogram 
prototype. The measurement of the Avogadro constant Na opened the way towards 
comparing the watt-balance measurements of h with the h/m quotients, where m 
is mass of a particle or of an atom. The h/m(n) ratio was determined by time- 
of-fiight measurements of monochromatic neutrons; optical spectroscopy and atom 
intcrfcrometry were instrumental to determine h/m(e), h/m(Cs), and h/m(Rb). 
Nuclear spectroscopy of 28 Si and 32 S, after the capture of a thermal neutron, allowed 
the NaIi product to be determined by comparing the mass defect with the frequencies 
of the 7 photons emitted in the cascades from the capture state to the ground state. 

Figure [1] shows the input data used in the analysis. The values calculated from 
the measurements of h/m(e), h/m(Cs), and h/m(Rb) have been averaged, because 
their uncertainty is mainly affected by the Na determination and, therefore, they are 
highly correlated. The values obtained from nuclear spectroscopy have been averaged 
because they were obtained by repetitions of the same experiment. 

Though none of them is totally out of scale, the data are inconsistent. The 
weighted mean is 6.62606960 x 10 -34 J/s with \ 2 = 14 for v = N — 1 = 9 degrees of 
freedom and Birge ratio Rb = \J X 2 l v — 1-26. In order to make the data consistent, it 
is usually assumed that the quoted uncertainties are wrong by a common scale factor. 
By multiplying the data uncertainties by the Birge ratio, the \ 2 of the weighted mean 



Table 2. Mode, mean, and median values of the Planck constant, given the data 
m table Q] The weighted-mean of the data — whose associated uncertainties have 
been expanded to make the Birge ratio unit - is also given, with the relevant 
standard deviation. The uncertainties associated to the mode and mean are the 
standard deviations of the post-data probability distribution; the sub- and super- 
script of the median indicate the extrema of the first and third quartilcs. 

mode 6.62606923(60) 
mean 6.62606937(60) 
median 6.62606931^2 



weighted mean 6.62606960(20) 
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Figure 2. Marginal distributions of the NIST 2007 and NPL 2010 measured 
values given lower bounds of the standard deviations equal to the associated 
uncertainties. The different areas under the curves are due to the improper 
character of these distributions. The measured values are also shown with their 
associated uncertainty. The reference value is the mode /irj = 6.62606923 X 10 — 34 
Js of the post-data distribution shown in Fig. [3] 



is 9 and the Birge ratio is 1; this removes the discrepancy between data spread and 
uncertainties. The weighted mean and the final associated uncertainty are given in 
table [3 

When assuming that the associated uncertainties are the lower bounds to 
unknown standard-deviations of the data, the probability calculus, as shown in section 
[3l allows a probability to be assigned to each possible h value. According to (|3]), the 
post-data distribution of the h values is 

0(/l)= zH 70 ■ (5) 

By using the ho = 6.62606923 x 10 -34 Js mode of the post-data distribution (the 




Figure 3. Post-data probability density of the values of the Planck constant 
calculated on the basis of the measurement results given in table [l] The input 
data are shown with their associated uncertainty. The reference value is the 
distribution mode, h = 6.62606923 X 10 -34 Js. 
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actual h value is unknown), the marginal sampling distributions, Q(x; ho, unistw) 
and Q{x; h , «nplio), of the NIST 2007 and NPL 2010 data arc shown in Fig. [2 The 
post-data distribution is shown in Fig. [3J the most probable h value and its mean and 
median are given in table [2] It is to be noted that all these estimates are smaller than 
the weighted mean and that the standard deviation of the post-data distribution is 
significantly greater than that of the weighted mean. 

5. Conclusions 

When data do not conform to the hypothesis of a known variance, the probability 
calculus offers a solution to the problem of fitting a constant to the set of measured 
values. By assigning probabilities to the possible mcasurand values, measurand values 
and confidence intervals can be optimally chosen. This approach is more rigorous than 
the search of estimators having optimal distributions for which no guideline exists. 
Assigning probabilities helps in the choice of a measurand value also when a traveling 
standard is used to compare the measurement capabilities of different laboratories. 
The post data distribution of the measurand values depends on the hypotheses made 
for the probability density of the unknown standard-deviation of data. The probability 
calculus allows different hypotheses to be compared; this will be the matter of future 
investigations. 
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